Dynamic Locomotion Skills for Obstacle Sequences Using Reinforcement Learning

نویسندگان

  • X. B. Peng
  • G. Berseth
  • M. van de Panne
چکیده

Most locomotion control strategies are developed for flat terrain. We explore the use of reinforcement learning to develop motor skills for the highly dynamic traversal of terrains having sequences of gaps, walls, and steps. Results are demonstrated using simulations of a 21-link planar dog and a 7-link planar biped. Our approach is characterized by: non-parametric representation of the value function and the control policy; value iteration using batched positive-TD updates; localized epsilon-greedy exploration; and an optimized state distance metric. The policies are progressively improved using repeated iterations of epsilon-greedy exploration and value iteration.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dynamic Obstacle Avoidance by Distributed Algorithm based on Reinforcement Learning (RESEARCH NOTE)

In this paper we focus on the application of reinforcement learning to obstacle avoidance in dynamic Environments in wireless sensor networks. A distributed algorithm based on reinforcement learning is developed for sensor networks to guide mobile robot through the dynamic obstacles. The sensor network models the danger of the area under coverage as obstacles, and has the property of adoption o...

متن کامل

An Adaptive Learning Game for Autistic Children using Reinforcement Learning and Fuzzy Logic

This paper, presents an adapted serious game for rating social ability in children with autism spectrum disorder (ASD). The required measurements are obtained by challenges of the proposed serious game. The proposed serious game uses reinforcement learning concepts for being adaptive. It is based on fuzzy logic to evaluate the social ability level of the children with ASD. The game adapts itsel...

متن کامل

Improving Reinforcement Learning of an Obstacle Avoidance Behavior with Forbidden Sequences of Actions

This paper is concerned with the improvement of reinforcement learning through the use of forbidden sequences of actions. A given reinforcement function can generate multiple effective behaviors. Each behavior is effective only considering the cumulative reward over time. It may not be the behavior expected by the designer. In this case, the usual solution is to modified the reinforcement funct...

متن کامل

Locomotion Planning with 3D Character Animations by Combining Reinforcement Learning Based and Fuzzy Motion Planners

Motion and locomotion planning have a wide area of usage in different fields. Locomotion planning with premade character animations has been highly noticed in recent years. Reinforcement Learning presents promising ways to create motion planners using premade character animations. Although RL-based motion planners offer great ways to control character animations but they have some problems that...

متن کامل

Reinforcement Learning Approaches for Locomotion Planning in Interactive Applications

Locomotion is one of the most important capabilities for virtual human agents in interactive applications, because it allows them to navigate their environment. Locomotion controllers in interactive applications typically work by blending and concatenating clips of keyframe or motion capture motion that represent individual locomotion actions (e.g. walk cycles), to generate sequences of natural...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015